4,379 research outputs found

    L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

    Full text link
    Language-based colorization produces plausible and visually pleasing colors under the guidance of user-friendly natural language descriptions. Previous methods implicitly assume that users provide comprehensive color descriptions for most of the objects in the image, which leads to suboptimal performance. In this paper, we propose a unified model to perform language-based colorization with any-level descriptions. We leverage the pretrained cross-modality generative model for its robust language understanding and rich color priors to handle the inherent ambiguity of any-level descriptions. We further design modules to align with input conditions to preserve local spatial structures and prevent the ghosting effect. With the proposed novel sampling strategy, our model achieves instance-aware colorization in diverse and complex scenarios. Extensive experimental results demonstrate our advantages of effectively handling any-level descriptions and outperforming both language-based and automatic colorization methods. The code and pretrained models are available at: https://github.com/changzheng123/L-CAD

    PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation

    Full text link
    Aerial Image Segmentation is a particular semantic segmentation problem and has several challenging characteristics that general semantic segmentation does not have. There are two critical issues: The one is an extremely foreground-background imbalanced distribution, and the other is multiple small objects along with the complex background. Such problems make the recent dense affinity context modeling perform poorly even compared with baselines due to over-introduced background context. To handle these problems, we propose a point-wise affinity propagation module based on the Feature Pyramid Network (FPN) framework, named PointFlow. Rather than dense affinity learning, a sparse affinity map is generated upon selected points between the adjacent features, which reduces the noise introduced by the background while keeping efficiency. In particular, we design a dual point matcher to select points from the salient area and object boundaries, respectively. Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods. Especially, our methods achieve the best speed and accuracy trade-off on three aerial benchmarks. Further experiments on three general semantic segmentation datasets prove the generality of our method. Code will be provided in (https: //github.com/lxtGH/PFSegNets).Comment: accepted by CVPR202

    A new decay mode of higher charmonium

    Full text link
    We calculate the ΛcΛˉc\Lambda_c\bar{\Lambda}_c partial decay width of the excited vector charmonium states around 4.6 GeV with the quark pair creation model. We find that the partial decay width of the ΛcΛˉc\Lambda_c\bar{\Lambda}_c mode can reach up to several MeV for ψ(4S, 5S, 6S)\psi(4S,~5S,~6S). In contrast, the partial ΛcΛˉc\Lambda_c\bar{\Lambda}_c decay width of the states ψ(3D, 4D, 5D)\psi(3D,~4D,~5D) is less than one MeV. If the enhancement Y(4630)Y(4630) reported by the Belle Collaboration in ΛcΛˉc\Lambda_c\bar{\Lambda}_c invariant-mass distribution is the same structure as Y(4660)Y(4660), the Y(4660)Y(4660) resonance is most likely to be a SS-wave charmonium state.Comment: 8 pages, 4 figure
    • …
    corecore